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Description 

[0001] The invention relates to data processing sys- 
tems and in particular to a method and system for sup- 
porting speculative execution of program instructions, s 
Still more particularly, the invention relates to preserva- 
tion of non -conditional state information for recovery af- 
ter speculative execution fails. 
[0002] Designers of data processing systems are 
continually attempting to enhance the performance of 10 
such systems. One technique for enhancing data 
processing system efficiency is the achievement of 
short cycle times and a lowCycles-Per-lnstruction (CPI) 
ratio in the system processor. An example of the appli- 
cation of these techniques to data processing system is *5 
the International Business Machines Corporation RISC 
System/6000(RS/6000)computer. The RS/6000 system 
is designed to perform well in numerically intensive en- 
gineering and scientific applications as well as in multi- 
user, commercial environments. The RS/6000 proces- 20 
sor employs a superscalar implementation, which 
means that multiple instructions are issued and execut- 
ed concurrently. 

[0003] Processor architecture relates to the combina- 
tion of registers, arithmetic units and control logic to 25 
build the computational elements of a computer. An im- 
portant consideration during building of a processor is 
the instruction set it will provide. An instruction is a state- 
ment which specifies an operation and the values or lo- 
cations of its operands. An instruction set is the collec- 30 
tion of all such valid statements for a particular machine. 
[0004] As originally conceived, RISC machines would 
execute one instruction per machine cycle. To this end 
all instructions were of one length and fit a scheme com- 
patible with a pipeline implementation. Simplicity in the 35 
instruction set was the design objective. This allowed 
further reduction in the cycle time compared with so 
called complex instruction set computers (CISC). How- 
ever, some of the benefits of RISC were offset by in- 
creases in traffic between the processor and the main 40 
memory for a computer. This occurred because a RISC 
machine requires more instruction instances to do a task 
than a CISC machine with its more powerful instruction 
set. 

[0005] Concurrence in issuance and execution of <s 
multiple instructions requires independent functional 
units that can execute with a high instruction bandwidth. 
The RS/6000 system achieves this by utilizing separate 
branch, fixed point and floating point processing units 
which are pipelined in nature. The branch processing so 
unit handles conditional branch instructions. In common 
with other RISC designs, complex decoding logic no 
longer required to decode instructions has been utilized 
to provide an instruction cache on the processor chip. 
This reduces traffic between the processor and memory, 55 
and makes fetches of instructions extremely fast. 
[0006] An instruction subset of great interest is that 
relating to conditional branches. Conditional branch in- 



structions are instructions which dictate the taking of a 
specified conditional branch within an application in re- 
sponse to a selected outcome of the processing of one 
or more other instructions. A practical example is a For- 
tran do-loop. Conditional branch instructions have long 
been a source of difficulty for pipeline computers (includ- 
ing RISC systems). By the time a conditional branch in- 
struction propagates through a pipeline queue to an ex- 
ecution position within the queue, it will have been nec- 
essary to load instructions corresponding to one branch 
into the queue behind the conditional branch instruction 
prior to resolving the conditional branch, in order to 
avoid run-time delays. This requires a choice be made 
as to which instruction will follow the conditional branch 
without knowing the outcome of processing the related 
instructions. The choice can prove wrong. 
[0007] The execution of instructions prior to the final 
possible definition of all conditions effecting execution 
is called speculative execution. To wait for the outcome 
of conditional branches, or the arrival of all possible in- 
terrupts, would make full concurrent processing impos- 
sible. Thus, some scheme for processor recovery from 
speculative execution of instructions must be provided 
if full use of concurrent execution of instructions is to be 
made. Upon determination that execution is proceeding 
down an incorrect branch an interrupt may be generated 
to change the course of execution. In responding to an 
interrupt, the processor is returned to the last non- spec- 
ulative execution step. 

[0008] Experience has demonstrated that use .of 
some complex operations in RISC machines can im- 
prove performance. This in part stems from the nature 
of currently preferred technology for implementation of 
processors, i.e. very large scale integration (VLSI). Min- 
imization of area used on a chip is now more important 
than minimizing the number of devices used to imple- 
ment the processor. Hence, some complex instructions 
have begun infiltrating into Rl SC based designs. The cri- 
teria for inclusion is minimum utilization of space. One 
instruction in the RS/6000 instruction set allows execu- 
tion of a branch on count loop. The branch on count in- 
struction is a one step instruction replacing what was 
formerly done in three instructions. Substitution of a sin- 
gle instruction for three instructions was enabled by pro- 
viding a dedicated count register. However, this ar- 
rangement does not in itself support speculative execu- 
tion. Implementation of the count register could be done 
by a mechanism provided in RS/6000 machines for reg- 
ister rename, but the value for the count register would 
not be known during the dispatch cycle resulting in some 
loss of machine cycles. 

[0009] Desirable is a hardware implementation of the 
branch on count loop which uses a minimum amount of 
area on a processor chip. 

[001 0] It is therefore one object of the invention to pro- 
vide an improved method and system for supporting 
speculative execution of program instructions. 
[0011] It is another object of the invention to provide 
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preservation of conditional state information for recov- 
ery after speculative execution fails. 
[0012] The foregoing objects are achieved by the in- 
vention as claimed. The invention provides a data 
processing system for speculatively executing instruc- 
tions. The data processing system includes a memory 
for storing instructions at addresses which can be gen- 
erated by a branch unit in a processor The processor 
also has a count register for storing an update value, a 
dispatch version value and a completion version value 
of a branch control count. A fetcher connected to the 
branch unit fetches instructions from memory based up- 
on addresses calculated by the branch unit. The branch 
unit handles processing of conditional branch instruc- 
tions. To do so, means for initializing the update value 
and the dispatch version value for branch control are 
provided. Further included are means responsive to 
completion of initialization for copying the update value 
as the completion version value. The system further in- 
cludes means responsive to dispatch of a conditional 
branch instruction for examining the dispatch version 
value to determine if a branch should be taken and then 
decrementing the dispatch version value. Means re- 
sponsive to completion of the branch provide for decre- 
menting contents of a completion version register. Final- 
ly, means responsive to occurrence of an interrupt prior 
to completion of the branch provide for replacing the dis- 
patch version value with the completion version value 
to restore the system to a state prior to the speculative 
execution of instructions. 

[0013] The novel features believed characteristic of 
the invention are set forth in the appended claims. The 
invention itself however, as well as a preferred mode of 
use, further objects and advantages thereof, will best be 
understood by reference to the following detailed de- 
scription of an illustrative embodiment when read in con- 
junction with the accompanying drawings, wherein: 

Figure 1 is a high level block diagram of a super- 
scalar computer system which may be utilized to im- 
plement the method and system of the present in- 
vention; 

Figure 2 is a reservation table illustrating the ma- 
nipulation of instruction queue content in a prior art 
data processing system utilizing an instruction 
queue; and 

Figure 3 is a schematic illustration of a branch on 
count register architecture in accordance with a pre- 
ferred embodiment of the invention; and 
Figure 4 is a schematic illustration of a branch on 
count register architecture in accordance with a 
second preferred embodiment of the invention. 

[0014] With reference now to the figures and in par- 
ticu lar with reference to Figure 1 , there is depicted a high 
level block diagram of a superscalar computer system 
10 which may be utilized to implement the method and 
system of the present invention. As illustrated, computer 
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system to preferably includes a memory 18 which is uti- 
lized to store data, instructions and the like. Data or in- 
structions stored within memory 18 are preferably ac- 
cessed utilizing cache/memory interface 20 in a method 
5 well known to those having skill in the art. The sizing 
and utilization of cache memory systems is a well known 
subspecialty within the data processing art is not ad- 
dressed within the present application. However, those 
skilled in the art will appreciate that by utilizing modern 
associative cache techniques a large percentage of 
memory accesses may be achieved utilizing data tem- 
porarily stored within cache/memory interface 20. 
[0015] Instructions from cache/memory interface 20 
are typically loaded into instruction queue 22 which pref- 
erably includes a plurality of queue positions. In a typical 
embodiment of a superscalar computer system the in- 
struction queue may include eight queue positions and 
thus, in a given cycle, between zero and eight instruc- 
tions may be loaded into instruction queue 22, depend- 
ing upon how many valid instructions are passed by 
cache/memory interface 20 and how much space is 
available within instruction queue 22. 
[001 6] As is typical in such superscalar computer sys- 
tems, instruction queue 22 is utilized to dispatch instruc- 
tions to multiple execution units. As depicted within Fig- 
ure 1 , computer system 1 0 includes a floating point proc- 
essor unit 24, a fixed point processor unit 26, and a 
branch processor unit 28. Thus, instruction queue 22 
may dispatch between zero and three instructions dur- 
ing a single cycle, one to each execution unit. 
[0017] In addition to sequential instructions dis- 
patched from instruction queue 22, so-called "condition- 
al branch instructions" may be loaded into instruction 
queue 22 for execution by the branch processor. A con- 
ditional branch instruction is an instruction which spec- 
ifies an associated conditional branch to be taken within 
the application in response to a selected outcome of 
processing one or more sequential instructions. In an 
effort to minimize run-time delay in a pipelined proces- 
sor system, such as computer system 10, the presence 
of a conditional branch instruction within the instruction 
queue is detected and an outcome of the conditional 
branch is predicted. As should be apparent to those hav- 
ing skill in the art when a conditional branch is predicted 
as "not taken" the sequential instructions within the in- 
struction queue simply continue along a current path 
and no instructions are altered. However, if the predic- 
tion as to the occurrence of the branch is incorrect, the 
instruction queue must be purged of sequential instruc- 
tions, which follow the conditional branch instructions in 
program order and target instructions must be fetched. 
Alternately, if the conditional branch is predicted as "tak- 
en" then the target instructions are fetched and utilized 
to follow the conditional branch, if the prediction is re- 
solved as correct. And of course, if the prediction of "tak- 
en" is incorrect the target instructions must be purged 
and the sequential instructions which follow the condi- 
tional branch instruction in program order must be re- 
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trieved. 

[0018] As illustrated, computer system 10 also pref- 
erably includes a condition registers 32. Condition reg- 
isters 32 are utilized to temporarily store the results of 
various comparisons which may occur utilizing the out- 
come of sequential instructions which are processed 
within computer system 10. Thus, floating point proces- 
sor unit 24 and fixed point processor unit 26 are coupled 
to condition registers 32. The status of a particular con- 
dition within condition registers 32 may be detected and 
coupled to branch processor unit 28 in order to generate 
target addresses, which are then utilized to fetch target 
instructions in response to the occurrence of a condition 
which initiates a branch. 

[0019] Thereafter, branch processor unit 28 couples 
target addresses to fetcher 30. Fetcher 30 calculates 
fetch addresses for the target instructions necessary to 
follow the conditional branch and couples those fetch 
addresses to cache/memory interface 20. As will be ap- 
preciated by those having skill in the art, if the target 
instructions associated with those fetch addresses are 
present within cache/memory interface 20, those target 
instructions are loaded into instruction queue 22. Alter- 
nately, the target instructions may be fetched from mem- 
ory 18 and thereafter loaded into instruction queue 22 
from cache/memory interface 20 after a delay required 
to fetch those target instructions. ( 
[0020] The manipulation of instruction queue content 
in a prior art data processing system utilizing an alter- 
nate instruction queue is illustrated in Figure 2 within 
reservation table 36 therein. Figure 2 depicts a table il- 
lustrating manipulation of instruction queue data content 
through seven consecutive cycle times. It may be seen 
that at cycle time 1 , the instruction queue includes a con- 
ditional branch instruction (be), a compare instruction 
(cmp) and four arithmetic logic unit (alu) instructions. 
Upon the detection of the conditional branch instruction 
within queue 3 of the prior art instruction queue, the se- 
quential instructions within the queue are loaded into an 
alternate instruction queue (not shown). Thereafter, a 
request for target instructions associated with the con- 
ditional branch is initiated at cycle 2 and those instruc- 
tions are loaded into the instruction queue at cycle 3. 
These instructions are based upon the prediction that 
the conditional branch associated with the conditional 
branch instruction will be "taken." 
[0021] Thereafter, at cycle 4, the compare (cmp) in- 
struction has propagated to the execution position within 
the instruction queue and the conditional branch instruc- 
tion is "resolved." In the event the resolution of the con- 
ditional branch instruction indicates that the conditional 
branch is "not taken" the sequential instructions previ- 
ously loaded into the alternate instruction queue are 
once again loaded into the primary instruction queue, 
as depicted at cycle 5. Cycles 6 and 7 within the instruc- 
tion queue of Figure 2 indicate the subsequent process- 
ing of additional sequential instructions. As illustrated, 
only a single empty cycle is present within the instruction 



queue following the misprediction of the conditional 
branch instruction. However, as described above, the 
implementation of this prior art technique requires the 
utilization of an alternate instruction queue. 
s [0022] Referring to Figure 3, a block diagram sche- 
matic of a region of registers is depicted, including gen- 
eral purpose register 40 and three dedicated registers 
used to implement a branch on count instruction with 
predictive branching, speculative execution of instruc- 
ts tions and recovery from speculative execution of in- 
structions along a wrong branch. The three dedicated 
registers include a dispatch version register 42, an up- 
date version register 44 and a completion version reg- 
ister 46. 

15 [0023] Functionally, the dispatch version register 42 
provides a dispatch stage version of the count to be 
used for address generation. The dispatch stage ver- 
sion always loads the completion stage version of the 
count stored in completion version register 46. The com- 

20 pletion stage version count is the count corresponding 
to the last speculatively executed and confirmed instruc- 
tion. The contents of update version register 44 corre- 
spond to the count for an instruction speculatively exe- 
cuted, which will be copied into register 46 when con- 

25 firmed. The purpose of the update version of the count 
is to enable decrementing of the dispatch version before 
the MOVE_TO_COUNT instruction completes execu- 
tion. The MOVE_TO_COUNT instruction may itself be 
speculatively executed. 

30 [0024] The movement of data into registers 42, 44 and 
46 is controlled by occurrence of certain instructions 
from the processor instruction set. The instructions are 
generated by a compiler and decoded by conventional 
control logic of the processor unit. Speculative execu- 

35 tion of a branch of a loop in a program is initiated by 
fixed point unit 26. A store instruction loading a count 
value into general purpose register 40 is executed. Sub- 
sequently execution of a MOVE_TO_COUNT instruc- 
tion begins, resulting in application of a gate control sig- 

40 nal to gates 48 and 50. As a result the contents of gen- 
eral purpose register 40 are copied into dispatch version 
register 42 and update version register 44. 
[0025] Upon dispatch of a BRANCH_ON_COUNT in- 
struction the contents of register 42 are examined to de- 

45 termine if the branch should be taken or if execution of 
steps of the program should fall through sequentially. A 
gate control signal is applied to gates 52 and 56, which 
results in the application of the contents of register 42 
to branch unit 28 and copying into register 42 of the prior 

so contents of the register less 1 by route of decrementer 
54. 

[0026] If an interrupt is taken prior to completion of 
instructions following a BRANCH JDN_COU NT instruc- 
tion, indicating, for example, that the wrong sequence 
55 of instruction was followed, a gate control signal is ap- 
plied to gate 58. Gate 58 is used to copy the contents 
of completion version register 46 into dispatch version 
register 42. This returns the state of dispatch version 
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register 42 to that preceding execution of any specula- 
tive instruction not yet confirmed. 
[0027] As previously described, gate 48 controls the 
copying of the contents of general purpose register 40 
to update version register 44 with initiation of a 
MOVE_TO_COUNT instruction. The contents of update 
register 44 are copied to completion version register 46 
by gate 60 with completion of the MOVE_TO_COUNT 
instruction. There the initial count is preserved until a 
branch following a BRANCH JDN_COU NT instruction is 
completed and confirmed. Upon completion logic of the 
processor signal removing the tentative markings from 
the results of the branch, a gating signal is applied to 
gate 62 resulting in the contents of register 46 being dec- 
remented by decrementer 64 and the result being cop- 
ied back into register 46. 

[0028] Figure 4 illustrates a simplified but lower per- 
formance embodiment of the invention, in which gate 46 
and update version register 44 have been eliminated. 
The movement of data into dispatch version register 42 
and completion version register 46 is controlled by sig- 
nals to various gates. The completion version register 
46 now receives its data from the dispatch version reg- 
ister 42, rather than the update version register. The sig- 
nals are similar to those discussed with reference to Fig- 
ure 3, but occur at somewhat different times. The most 
important change in timing is that for BRANCH_ON_ 
COUNT instructions, which cannot execute until a 
MOVE_TO_COUNT instruction completes and the 
completion version register 46 has been loaded. 
[0029] With initiation of a MOVE_TO_COUNT in- 
struction, a gate signal is applied to gate 50 copying the 
contents of general purpose register 40 into dispatch 
version register 42. When the MOVE_TO_COUNT com- 
pletes a signal applied to gate 60 results in the contents 
of the dispatch version register 42 being applied to com- 
pletion version register 46. Now a DISPATCH_ON_ 
COUNT instruction can be executed. The handling of 
BRANCH_ON_COUNT instructions, interrupts and 
complete BRANCH_ON_COUNT is identical to the first 
embodiment. 



Claims 

1. A processor for a data processing system (10) in- 
cluding a branch unit (28) for operating on condi- 
tional branch instructions and calculating target ad- 
dresses for use in fetching instructions from a mem- 
ory device (1 8), said processor comprising: 

a source (40) of an initial version of a count val- 
ue related to the target address of a conditional 
branch; 

a register (42) for storing a dispatch version of 
a count value; 

a register (46) for storing a completion version 
of a count value corresponding to completed 



and confirmed instructions; 
means (50) responsive to beginning execution 
of a move to count instruction for loading the 
initial version of the count value into said regis- 

5 ter (42) for storing the dispatch version; 

means responsive to completion of a move to 
count instruction for moving the initial version 
of the count value to said register (46) for stor- 
ing the completion version; 

10 means (52,54) responsive to dispatch of a 

branch on count instruction for decrementing 
the dispatch version in said register (42) for 
storing the dispatch version; 
means (62,64) responsive to completion of the 

is branch for decrementing the contents of said 

register (46) for storing the completion version; 
and 

means (58) responsive to an interrupt for load- 
ing the contents of the register for storing the 
20 completion version into the register for storing 

the dispatch version. 

2. A processor as set forth in claim 1, wherein said 
means for moving the initial version of the count val- 

25 ue includes: 

an update register (44) for storing the initial ver- 
sion of the count; and 

a gate (60) for transferring the content of said 
30 update register (44) to said means (46) for stor- 

ing the completion version. 

3. In a data processing system (10) having a branch 
unit (28) in a processor for processing conditional 

35 branch instructions, a method of speculatively exe- 
cuting instructions recovered from memory (18) 
based upon addresses calculated by said branch 
unit (28), the method including the steps of: 

40 initializing a dispatch version register (42) with 

a count value for control of the conditional 
branch; 

upon completion of initialization of said dis- 
patch version register (42), copying the initiali- 
45 zation data to a completion version register 

(46); 

responsive to dispatch of a conditional branch 
instruction, examining contents of said dispatch 
version register (42) to determine a branch to 

50 be taken and then decrementing the contents 

of said dispatch version register (42); 
responsive to completion of the taken branch, 
decrementing contents of said completion ver- 
sion register (46); and 

55 upon occurrence of an interrupt prior to com- 

pletion of the taken branch, copying the con- 
tents of said completion version register (46) to 
said dispatch version register (42). 
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Patentanspruche 

1. Prozessor fur ein Datenverarbeitungssystem (10), 
der eine Verzweigungseinheit (28) zum Betrieb mit 
bedingten Verzweigungsbef ehlen und zum Berech- 
nen von Zieladressen zum Gebrauch in Abf ragebe- 
fehlen von einem Speicherbau element (18) enthalt, 
wobei der Prozessor umfasst: 

eine Quelle (40) einer Anfangsversion eines 
Zahlwertes, der mit der Zieladresse einer be- 
dingten Verzweigung verbunden ist; 

ein Register (42) zum Speichem einer Aus- 
wahlversion eines Zahlwertes; 

ein Register (46) zum Speichern einer Ab- 
schlussversion eines Zahlwertes, der ausge- 
fuhrten und bestatigten Befehlen entspricht; 

Mittel (50), die auf beginnende Ausfuhrung ei- 
ner Ubertragung zum Zahlbefehl zum Laden 
der Anfangsversion des Zahlwertes in das Re- 
gister (42) zum Speichern der Auswahlversion 
reagieren; 

Mittel, die auf den Abschluss einer Ubertragung 
zum Zahlbefehl reagieren, urn die Anfangsver- 
sion des Zahlwertes in das Register (46) zum 
Speichern der Abschlussversion zu ubertra- 
gen; 

Mittel (52, 54), die auf das Auswahlen einer 
Verzweigung am Zahlbefehl reagieren, urn die 
Auswahlversion in dem Register (42) zum 
Speichern der Auswahlversion zu vermindern; 

Mittel (62, 64), die auf den Abschluss der Ver- 
zweigung zum Vermindern der Inhalte des Re- 
gisters (46) zum Speichern der Abschlussver- 
sion reagieren; und 

Mittel (58), die auf eine Unterbrechung zum La- 
den des Inhalts des Registers zum Speichern 
der Abschlussversion in das Register zum 
Speichern der Auswahlversion reagieren. 

2. Prozessor, wie er in Anspruch 1 erlautert ist, wobei 
das Mittel zum Ubertragen der Anfangsversion des 
Zahlwertes enthalt: 

ein Aktualisierungsregister (44) zum Speichern 
der Anfangsversion des Zahlwerts; und 

ein Gatter (60) zum Ubertragen des Inhaltes 
des Aktualisierungsregisters (44) an das Mittel 
(46) zum Speichern der Abschlussversion. 



3. In einem Datenverarbeitungssystem (1 0), das eine 
Verzweigungseinheit (28) in einem Prozessor zum 
Verarbeiten bedingter Verzweigungsbef eh le hat, 
Verfahren der spekulativen Ausfuhrung von Befeh- 
s len, die aus Speicher (18) wiederhergestellt wer- 
den, das auf Adressen beruht, die durch die Ver- 
zweigungseinheit (28) berechnet werden, wobei 
das Verfahren die Schritte enthalt: 

10 Initialisieren eines Auswahlversionsregisters 

(42) mit einem Zahlwert zum Steuern der be- 
dingten Verzweigung; 

Nach Abschluss der Initialisierung des Ver- 
is zweigungsversionsregisters (42) Kopieren der 

Initialisierungsdaten in ein Abschlussversions- 
register (46); 

als Reaktion auf das Auswahlen eines beding- 
20 ten Verzweigungsbefehls Prufen des Inhalts 

des Verzweigungsversionsregisters (42), urn 
eine Verzweigung zu bestimmen, die genom- 
men werden soil, und dann die Inhalte des Aus- 
wahlversionsregisters (42) zu vermindern; 

25 

als Reaktion auf die genommene Verzweigung 
Vermindern der Inhalte des Abschlussversi- 
onsregisters (46); und 

30 beim Auftreten einer Unterbrechung vor dem 

Abschluss der genommenen Verzweigung Ko- 
pieren des Inhalts des Abschlussversionsregi- 
sters (46) in das Auswahlversionsregister (42). 



35 

Revendications 

1 . Processeur pour un systeme de traitement de don- 
nees (10) incluant une unite de branchement (28) 
40 pour agir sur des instructions de branchement con- 
ditionnel et calculer des adresses cibles pour utili- 
sation dans Textraction des instructions a partir d'un 
dispositif memoire (18), ledit processeur 
comprenant : 

45 

une source (40) d'une version initiale d'une va- 
leur de compte liee a I'adresse cible d'un bran- 
chement conditionnel ; 

un registre (42) pour memoriser une version de 
so distribution d'une valeur de compte ; 

un registre (46) pour memoriser une version 
d'achevement d'une valeur de compte corres- 
pondant a des instructions achevees et 
confirmees ; 

55 un moyen (50) sensible au commencement de 

I'execution d'une instruction d6placer pour 
compter pour charger la version initiale de la 
valeur de compte dans ledit registre (42) pour 
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m£moriser la version de distribution ; 
un moyen sensible a I'achevement d'instruction 
d£placer pour compter pour d6placer ladite ver- 
sion initiale de la valeur de compte vers ledit 
registre (46) pour m6moriser la version s 
d'achevement ; 

un moyen (52, 54) sensible a une distribution 
d'une instruction de branchement sur compte 
pour d£cr£menter la version de distribution 
dans ledit registre (42) pour m6moriser la ver- 10 
ston de distribution ; 

un moyen (62, 64) sensible a I'achevement du 
branchement pour d6cr6menter les contenus 
dudit registre (46). pour m6moriser la version 
d'achevement ; et is 
un moyen (58) sensible a une interruption pour 
charger les contenus du registre pour m£mori- 
ser la version d'achevement dans le registre 
pour m6moriser la version de distribution. 

20 

2. Processeur selon la revendication 1 , dans lequel le- 
dit moyen pour dSplacer la version initiale de la va- 
leur de compte inclut : 

un registre de mise a jour (44) pour m6moriser 25 
la version initiale du compte ; et 
une porte (60) pour transferer le contenu dudit 
registre de mise a jour (44) audit moyen (46) 
pour m6moriser la version d'achevement. 

30 

3. Dans un systeme de traitement de donn6es (10) 
comportant une unite* de branchement (28) dans un 
processeur pour traiter des instructions de branche- 
ment conditionnel, un proc6d6 d'ex6cution des- 
tructions de maniere speculative recup6r£es a par- 35 
tir d'une mSmoire (18) bas6e sur des adresses cal- 
culus par ladite unite de branchement (28), le pro- 
cede comprenant les etapes consistant a : 

initialiser un registre de version de distribution 40 
(42) avec une valeur de compte pour la com- 
mande du branchement conditionnel ; 
sur achievement de ('initialisation dudit registre 
de version de distribution (42), copier les don- 
n6es d'initialisation dans un registre de version 45 
d'achevement (46) ; 

en r6ponse a la distribution d'une instruction de 
branchement conditionnel, examiner les conte- 
nus dudit registre de version de distribution (42) 
pour determiner qu'un branchement doit avoir so 
lieu et d6cr6menter ensuite les contenus dudit 
registre de version de distribution (42) ; 
en reponse a I'achevement du branchement, 
d6cr6menter les contenus dudit registre de ver- 
sion d'achevement (46) ; et 55 
sur arriv6e d'une interruption avant achieve- 
ment du branchement effectue\ copier les con- 
tenus dudit registre de version d'achevement 



(46) dans ledit registre de version de distribu 
tion (42). 
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